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(54) Filing system and method capable of avoiding filing of identical document data 



(57) A file system includes a processing device that 
processes data for processing with at least one of a cop- 
ying function to read image data of an original document 
and record the read image data on a sheet, a transmit- 
ting function to send and receive image data and/or 
character data via a communication line, and a record- 
ing function to record received image data and/or char- 
acter data on a sheet, and a memory device to store the 
processing data processed by the processing device. 
The file system includes an identity determination de- 
vice to determine an identity between the processing da- 
ta and data stored in the memory device, and a storing 
management device stores the processing data into the 



memory device on the basis of a result of a determina- 
tion made by the identity determination device. The stor- 
ing management device cancels storing the processing 
data into the memory device when the identity determi- 
nation device determines that the processing data is 
identical to data stored in the memory device. The iden- 
tity determination device determines the identity be- 
tween the processing data and the data stored in the 
memory device based upon information of processes 
with which the processing data has been processed with 
the processing device. The information of processes in- 
cludes information of an original document associated 
with the processing data. 
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Description 

[0001] The present invention relates to a file systenn, 
and more particularly to a file system applied to a data 
processing system for copying, facsimile network com- 
munication, printing or otiier data processing, that is ca- 
pable of avoiding filing of identical document data so as 
to make good use of the storage capacity of a memory 
device of the filing system. 

[0002] Conventionally printed documents which are 
important or documents which are possibly used in fu- 
ture have been filed and placed in order on a shelf or 
the like. In an office having an enormous amount of doc- 
uments, however, a wide space is required for keeping 
the documents and in addition, it has taken plenty of time 
to find out a required document, 
[0003] Accordingly, in recent years, with further ad- 
vanced high-speed data processing technologies and 
with a tendency of lowering prices of storage devices, 
there has been proposed what is called a file system for 
reading documents with a scanner and storing the doc- 
uments in a mass storage device. These file systems 
are introduced into not only offices having an enormous 
amount of documents but also into other places. 
[0004] As this type of file systems, there have been 
proposed file systems incorporating features designed 
to provide easy retrievals of related documents by sort- 
ing documents systematically by types in a database 
and more recently various file systems in which availa- 
bility has been improved. For example, in Japanese 
Laid-open Patent Publication No. 5-35737 there is de- 
scribed a file system in which reduced images of stored 
document data are created and displayed in a calendar 
view format, and in Japanese Laid-open Patent Publi- 
cation No. 6-119393 there is described a file system in 
which data is sorted, registered (stored), and retrieved 
in a box, calendar, or card format. Furthermore, in Jap- 
anese Laid-open Patent Publications No. 8-255220 and 
No. 9-128402 there are described file systems in which 
continuity or similarity of document data is analyzed. 
[0005] These conventional file systems, however, re- 
quire reading documents with a scanner and inputting 
information for retrieval, which is time-consuming. 
Therefore, the documents tend to be left for processing 
later just to be piled up. To store these documents in 
order in the file system, it must be first determined 
whether or not the documents need to be stored, and 
then required documents must be read with the scanner 
individually and an input work is necessary for sorting. 
Because of these complicated work for filing, users tend 
to reduce the amount of documents for filing by discard- 
ing documents which are not important. 
[0006] This may cause a problem that some of the dis- 
carded documents are not available when they are 
needed aftenward. 

[0007] Accordingly, when checking whether not each 

document should be stored, the determination is not al- 
ways easy, and the determination work takes a long 



time. Furthermore, documents not required at that time 

may be needed later. 

[0008] Generally, documents stored in a file system 
are those copied for a use in a conference, those sent 
5 or received to or from a customer via a facsimile device, 
or those created by a workstation (WS) or a personal 
computer (PC) and printed out. In other words, docu- 
ments to be stored in the file system have been convert- 
ed to electrical signals and recorded on a recording 
10 sheet once or more times. Additionally documents used 
for a conference or those to be circulated may be copied 
repeatedly at difference time and places. 
[0009] The present invention has been made in view 
of the above-discussed and other problems, and pre- 
^5 ferred embodiments of the present invention provide a 
file system, in which wasteful usage of storage capacity 
of a memory device is avoided by preventing storing of 
identical data in the memory device and in which proc- 
essed data stored in the memory device can be readily 
20 reused when required. 

[0010] According to a preferred embodiment of the 
present invention, a file system includes a processing 
device that processes data for processing with at least 
one of a copying function to read image data of an orig- 
25 inal document and record the read Image data on a 
sheet, a transmitting function to send and receive image 
data and/or character data via a communication line, 
and a recording function to record received image data 
and/or character data on a sheet, and a memory device 
30 to store the processing data processed by the process- 
ing device. The file system further includes an identity 
determination device to determine an identity between 
the processing data and data stored in the memory de- 
vice, and a storing management device stores the 
35 processing data into the memory device on the basis of 
a result of a determination made by the identity deter- 
mination device. The storing management device can- 
cels storing the processing data into the memory device 
when the identity determination device determines that 
40 the processing data is identical to data stored in the 
memory device. The storing management device adds 
link information for relating the processing data, deter- 
mined to be identical to data in the memory device by 
the identity determination device, with the data in the 
45 memory device. 

[0011] According to the invention, the identity deter- 
mination device may determine the identity between the 
processing data and the data stored in the memory de- 
vice based upon information of processes with which the 
50 processing data has been processed with the process- 
ing device. 

[0012] The information of processes may include in- 
formation of an original document associated with the 
processing data. 
55 [0013] The information of an original document may 
include information of a size and a direction of the orig- 
inal document, information as to whether the original 
document has an image on one side or both sides of the 
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original document, or information as to whetlierthe orig- 
inal document is a slieet or book. 
[0014] The identity determination device may deter- 
mine a degree of the identity between the processing 
data and the data stored in the memory device. 
[0015] The identity determination device may deter- 
mine the degree of the identity between the processing 
data and the data stored in the memory device based 
upon a degree of an identity of an image of an original 
document associated with the processing data and/or a 
degree of an identity of the original document. 
[0016] The storing management device adds link in- 
formation for relating the processing data with the data 
stored in the memory device based upon the degree of 
the identity determined by the identity determination de- 
vice. 

[0017] The file system may further include a display 
device to display information, an operation device to in- 
put instructions, and an output management device to 
create specific information for specifying data in the 
memory device to display the specific information on the 
display device so as to be selected by the operation de- 
vice and to read out data which has been specified via 
the selection of the specific information specifying the 
data from the memory device to output the specified da- 
ta to the processing device. The output management 
device displays the specific information of the process- 
ing data to which the link information is added on the 
display device with the degree of the identity being high- 
lighted. 

[0018] The storing management device may display 
In the operation device a message for asking a person 
who processes the processing data about storing of the 
processing data to the memory device. 
[0019] The storing management may further include 
an ID obtaining device to obtain a user ID of a user who 
processes the processing data with the processing de- 
vice and add the user ID obtained by the ID obtaining 
device to the processing data to be stored in the memory 
device, and the identity is determined by the identity de- 
termination device between the processing data and the 
data stored in the memory device, having the same user 
ID. 

[0020] In the file system, the processing device and 
the memory device may be on a substantially same in- 
tranet. 

[0021] According to another embodiment of the 
present invention, the file system may include a first 
memory device and a second memory device to store 
the processing data processed by the processing de- 
vice, and the storing management device may read out 
a given amount of document data from the first memory 
device and transfers the given amount of document data 
to the second memory device when a preset capacity of 
the first memory device is exceeded. 
[0022] According to the present invention, data proc- 
essed by the processing device is stored in the memory 
device on the basis of a determination result as to 



whether or not the data processed by the processing 
device has an identity with data which has already been 
stored in the memory device. If the processing data is 
determined to be identical to data in the memory device, 

5 storing of the processing data to the memory device is 
canceled or aborted, and othenwise, the processing data 
is stored in the memory device. Therefore, the process- 
ing data is not only processed by the processing device 
but also stored in the memory device if the data is not 

10 identical to the stored data, without any works for storing 
the data in the memory device, and further, the storage 
capacity of the memory device is saved by avoiding stor- 
age of the processing data in the memory device when 
the identical data exists in the memory device. 

15 [0023] Further, if the processing data has some iden- 
tity with data in the memory device, the processing data 
processed by the processing device is stored in the 
memory device with link information for relating the 
processing data with the already stored data associated 

20 with the processing data, and specific information of the 
data in the memory device, for example, a thumbnail im- 
age of the data, is displayed in a calendar display format 
with a degree of the identity highlighted. Therefore, 
processing data having a higher degree of the identity 

25 can be easily discriminated from other data so as to be 
selected and is output to a connected processing device 
for processing the data there. 

[0024] Furthermore, the storing management device 
asks a person who processes the processing data with 

30 a processing device about storing of the processing data 
into the memory device, and storing of the processing 
data having an identity with the already stored data is 
canceled only according to an instruction of the person 
who processes the processing data, i.e., only when the 

35 person processing the data with the processing device 
specifies that the storage to the memory means is un- 
necessary Therefore, an automatic storage of the 
processing data is never canceled nor the processing 
data is associated with another data against an opera- 

40 tor's will. 

[0025] A more complete appreciation of the present 
invention and many of the attendant advantages of 
thereof will be readily obtained as the same becomes 
better understood by reference to the following detailed 
45 description when considered in conjunction with the ac- 
company drawings wherein: 

Fig. 1 is a diagram of a file system according to the 
present invention illustrating an outline of the con- 
so stitution of the system; 

Fig. 2 is a block diagram of a processing unit in the 
file system; 

Fig. 3 is a top view illustrating a display device and 
an operation device of the processing unit; 
55 Fig. 4 is a perspective view illustrating a reading de- 
vice of the processing unit; 
Fig. 5 is a perspective side view of the reading de- 
vice; 
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Fig. 6 is a block diagram illustrating a main portion 
constituting a memory unit in the file system; 
Fig. 7 is a flowcliart for an explanation of a part of 
file processing of the file system; 
Fig. 8 is a timing chart for an explanation of adding 
additional data to processing data in the file system; 
Fig. 9 is a timing chart for an explanation of adding 
additional data in a manner different from Fig. 8; 
Fig. 10 is a top view illustrating a part of a display 
operation device of the processing unit for an ex- 
planation of additional data to be processed in the 
file system; 

Fig. 11 is a list for an explanation of the additional 
data for the file processing of the file system; 
Fig. 12 is a list for an explanation of one piece of 
the additional data of the file system; 
Fig. 1 3 is a flowchart for an explanation of a part of 
file processing different from the one in Fig. 7; 
Fig. 14 is a flowchart for an explanation of reusing 
filed processing data in the file system; 
Fig. 1 5 is a display screen for an explanation of ref- 
erencing the filed processing data in the file system; 
Fig. 16 is a partially enlarged view of the display 
screen of the file system; 

Fig. 17 is a diagram for an explanation of referenc- 
ing processing, illustrating lists displayed in the dis- 
play screen after selection of a retrieval button in 
the display screen of Fig. 15; 
Fig. 18 is a diagram for an explanation of reference 
processing different from Fig. 17; 
Fig. 1 9 is a block diagram for an explanation of 
transmitting data to be processed in the file system; 
Fig. 20 is a diagram illustrating an example of a doc- 
ument to be processed in the file system; and 
Figs. 21 (a) and 21 (b) are diagrams for an explana- 
tion of aspects of an image of a document and the 
document for determining a degree of an identity of 
document data. 

[0026] Referring now to the drawings, wherein like ref- 
erence numerals designate identical or corresponding 
parts throughout the several views, preferred embodi- 
ments of the present invention will be described. 
[0027] In Fig. 1, a data management system 10 is con- 
figured to function as a data backup system for backing 
up data which is processed with a processing unit by a 
user and also to function as a file system if the user so 
desires. The data management system 10 includes a 
high level function digital copying machine (MFP: Multi- 
function printer) 11 connected on an intranet in a user's 
office, a server machine 12, a mass storage device 
(IMS: Infinite memory server) 13, a personal computer 
(PC) 14, a connecting device (MFB: Multi-function box) 
15, and a mass storage device (Web IMS) 16 on the 
internet for providing services of a service provider de- 
scribed later. 

[0028] The copying machine 11 includes, as illustrat- 
ed in Fig. 2, a control section 21 which integrally controls 



components of the machine 11, and a display section 
22, an operating section 23, an NCU section (a network 
control unit) 24, a communication control section 25, a 
reader 26, a recorder 27, an image memory section 28, 

5 and an image processing section 29, which are all con- 
nected to the control section 21 via a bus 30. The control 
section 21 executes various types of processing of the 
present invention and various functions described later 
by storing various information such as driving conditions 

10 of the components of the machine 1 1 and management 
data according to a control program read out from a 
ROM (read only memory) by a built-in CPU (central 
processing unit) and by using a RAM (random access 
memory) in which required data is stored for the opera- 

^5 tion. 

[0029] The display section 22 and the operating sec- 
tion 23 are arranged in an operation and display panel 
provided on a top of a front portion of the machine body 
illustrated in Fig. 3. As illustrated in the drawing, a touch 
20 panel display operation LCD (liquid crystal display) 22a, 
a ten key 23b, function keys (F keys) 23c, a start key 
23d, and a stop key 23e are arranged in the operation 
and display panel for input operations of user settings, 
instructions or the like and for displaying various infor- 
ms mation such as driving conditions, a device status, or 
input information. In addition, a slot, which is not shown, 
to set an ID card for reading or writing various informa- 
tion from/to the ID card is arranged in the operation and 
display panel. 

30 [0030] The communication control section 25 is con- 
nected to the NCU section (a network control unit) 24 
for connecting or disconnecting a line by executing giv- 
en line controls when making an outgoing or incoming 
call via a PSTN (public switched telephone network). 
55 The communication control section 25 modulates or de- 
modulates image data or various procedure signals with 
a built-in modem and performs a facsimile network com- 
munication (sending or receiving processing) via the 
NCU section 24. Further the communication control 
40 section 25 is connected to an intranet via an l/F (an in- 
terface) which is not illustrated and performs transmis- 
sion (sending and receiving) of document data, such as 
image data and character data. 
[0031] The reader 26 is configured, as shown in Figs. 
45 4 and 5^ such that a document P is placed with being 
positioned so that an angle of the document matches a 
document position reference 26c formed by an included 
angle of a document scale 26b on a contact glass 26a 
having a large area. The reader 26 reads image data to 
50 be transmitted or copied from the document P with the 
document P being put in closely contact with the contact 
glass 26a by a pressurizing plate 26d, which is provided 
on the contact glass 26a so as to open and close to be 
put in contact with and separated from the document P 
55 Alight beam is emitted from an exposing Iamp26f which 
extends in a horizontal scanning direction on a first car- 
riage 26e. The first carriage 26e moves in a vertical 
scanning direction on the document P which has been 
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set. A reflected light from an Image surface of the doc- 
ument P is deflected by a first mirror 26g and is then 
reversed by a second mirror and a third mirror 26i and 
26j mounted on a second carriage 26h, which moves at 
a half speed of the first carriage 26e for keeping a con- 
stant optical path length L of the reflected light. The re- 
flected light is projected on a CCD (charge coupled de- 
vice) 26m through an imaging lens 26k and the image 
data of the document P is read by a photoelectric con- 
version. It is needless to say that the reader 26 may have 
an automatic document feeder (ADF) for automatically 
conveying documents P set on a document table onto 
the contact glass 26a and for discharging them to an 
output table after reading the documents instead of the 
pressurizing plate so that a plurality of documents P can 
be automatically processed. 

[0032] The recorder 27 records an image on a sheet, 
for example, with 400 dpi density and 256 gradations in 
a known electrophotographic recording method, ac- 
cording to image data which has been read or received 
and stored in bit mapping in the image memory section 
28 including a hard disk unit. While the details are not 
described here, in the known electrophotographic meth- 
od, an electrostatic latent image according to read or 
received image data is formed by optically writing the 
data on a photosensitive body which has been charged 
while being rotated, and then toner is attached to the 
photosensitive body for developing the latent image with 
the toner, and a sheet having an appropriate size for the 
recording image or a specified size is conveyed from a 
feed cassette to transfer the developed toner image 
thereupon and then the sheet carrying the toner image 
is discharged outside the machine 1 1 after the toner im- 
age is fixed. It is needless to say that the recorder 27 
may be an ink jet type, a thermal recording type, or any 
other types, besides the electrophotographic recording 
type. 

[0033] The image processing section 29 compresses 
and encodes image data to be sent, and decompresses 
and decodes received image data. The image process- 
ing section 29 further executes converting processing 
to convert character data (code data) of documents cre- 
ated by a user using the PC 14 into image data by bit- 
mapping the character data in the image memory sec- 
tion 28 as required. The image data compression per- 
formed by the image processing section 29 is intended 
for decreasing an amount of data, and therefore any of 
known methcds may be applied only if both of the cop- 
ying machine 1 1 and the server machine 1 2 can process 
the data. For example, aGBTG (generalized block trun- 
cation coding) method can be applied to a compression 
of a bit map data of 400 dpi with 8 bit per pixel in the 
image memory 28. 

[0034] Accordingly, this copying machine 11 includes 
a processing unit having atransmission function for per- 
forming afacsimile network communication in which im- 
age data is transmitted and a data communication in 
which document data is transmitted between PCs 14, a 



copying function for recording read image data on a 
sheet and outputting the recorded sheet, and a record- 
ing function for recording received document data and 
outputting the recorded sheet, by which it serves as a 
5 facsimile device, a printer or a scanner as well as a cop- 
ying machine. 

[0035] The ID card set in the operating section 23 of 
the copying machine 11 contains information such as, 
telephone numbers for facsimile communication, ad- 
10 dresses of the PC 14, processing conditions such as a 
reduction ratio for copying, a user ID, a user name and 
so forth. 

[0036] The copying machine 1 1 reads the processing 
conditions contained in the ID card when the start key 

^5 23d is depressed after a function is selected by a de- 
pression of an F key 23c of the operating section 23 such 
that the user can use various functions of the machine 
11 easily. Further, the user ID is read from the ID card 
(or the user ID is received with document data for a use 

20 with the PC 14) and management information, such as 
the processing function which has been used or the 
number of processed sheets, is stored in a RAM of the 
control section 21 for each user ID so that it can be used 
for accounting processing. Therefore, when the copying 

25 machine 11 performs desired data processing to docu- 
ment data with the provided functions according to a 
processing instruction (including processing instruc- 
tions from the PC 14 and recording instructions of re- 
ceived document data to be locally processed in the ma- 

30 chine 11) inputted by a user, the copying machine 11 
appropriates and adds the user I D to the processed doc- 
ument data as additional data (specific information) 
without requesting an input of the user ID, when sending 
out the document data to the server machine 12 (de- 

35 scribed later). 

[0037] If the copying machine 11 is operated without 
setting of the IC card therein (without input of the user 
ID), the copying machine 11 reads out a shared user ID 
which has previously been allocated to the copying ma- 

40 chine 11 for use fora shared cost at accounting process- 
ing from a nonvolatile RAM so as to use the shared user 
ID as the user ID and processes the document data as 
shared document data. 

[0038] The connecting device 15 functions as a net- 

45 work hub of terminal devices, such as the copying ma- 
chine 11, the sen/er machine 12, and the PCs 14, and 
constructs a local area network (LAN) environment by 
relaying data communication between the terminal de- 
vices. The connecting device 1 5 further connects to oth- 
50 er local area net works constructing an intranet environ- 
ment. The connecting device 1 5 further connects to the 
internet enabling a user to use various types of informa- 
tion by accessing to a service provider company on the 
internet from the copying machine 11, the server ma- 
ss chine 12, or the PC 14. 

[0039] The PC 1 4 includes a CPU, a memory (ROM, 
RAM, etc.), an I/O (input-output) circuit or the like. The 
PC 14 can be used as a system for performing various 
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types of processing, such as creation of a document or 
an image, by arithmetic operations according to an ap- 
plication program read out from a hard disk unit (a mem- 
ory medium), which is not shown, by operations of a key- 
board or a mouse while viewing a display. Document 
data created by the PC 14 can be printed out with vari- 
ous functions of the copying machine 11 by sending out 
a processing instruction together with a user ID to the 
copying machine 11 or be sent to a facsimile machine 
at an entered destination. Further, the PC 1 4can receive 
image data which is received by the copying machine 

1 1 from a facsimile machine, or image data read by the 
copying machine 11. 

[0040] The mass storage device 16 is connected to 
the internet via a communication control unit (not 
shown) of a service provider. When an access from a 
registered user is received, the communication control 
unit enables the mass storage device 16 to be read or 
written according to a control program read out from a 
memory device by a CPU. When a storage instruction 
is received, the mass storage device 16 stores docu- 
ment data following the instruction, which is associated 
with additional data (specific information), which will be 
described later, as received. When a reference instruc- 
tion for the document data for storing is received imme- 
diately after the access, the mass storage device follows 
the reference instruction. For example, if a transfer in- 
struction is sent for document data whose address is 
specified by specification of the additional data (user ID) 
by the server machine 12, the mass storage device 16 
reads out document data of the address and sends out 
the read document data. 

[0041] As illustrated in Fig. 6, the server machine 12 
includes a PC having a CPU 41 , a memory (ROM, RAM, 
etc.) 42, a hard disk unit (a memory medium) 43, a dis- 
play 44, a touch panel 45, a keyboard 46, a mouse 47, 
an I/O (input-output) circuit 48, a network interface 49, 
and a timer facility 50. The PC can be used like the PC 
14. The server machine 12 executes various types of 
processing of the present invention with integrally con- 
trolling the components 42 to 49 of the server machine 

12 by constructing various drivers, such as a file driver 
51 or a display driver 52 illustrated in Fig. 1 9, according 
to an application program read out from the hard disk 
unit 43 by the CPU 41 . The server machine 1 2 is con- 
nected to the mass storage device 1 3 via the input-out- 
put circuit 48 and to an Ethernet cable constructing an 
intranet via the network interface 49. A nonvolatile RAM 
in the memory section 42 stores data necessary for re- 
ceiving backup services of a service provider with a con- 
nection to the internet, such as an address of the service 
provider, a registered ID (a user ID for receiving the serv- 
ices, which can be identical to a user ID in the ID card 
for using the copying machine 11), and a password, in 
order to receive the services. The network interface 49 
may function as a modem to establish a connection to 
the service provider via a telephone line without using 
the intranet. 



[0042] The server machine 1 2 is configured to receive 

data to be processed by the copying machine 1 1 via the 
intranet and to send the data to the mass storage device 
1 3 at the same time so as to store the data as received 

5 in a memory device of the mass storage device 1 3. At 
this point, a used capacity (a used storage capacity) of 
the mass storage device 1 3 is determined, and when 
the used capacity is found to exceed a predetermined 
amount, a given amount of document data is read out 

10 from the mass storage device 13 sequentially in order 
of age and is transferred to the mass storage device 1 6 
to be stored therein by accessing to the service provider 
on the intemet by using address of the sen/ice provider, 
the registered I D, or the password in the memory section 

^5 42, before or after executing a storage of the document 
data. In addition, according to a request by the user, the 
server machine 1 2 reads out a part of the document data 
stored in the mass storage device 1 3, for example, a 
thumbnail image for the first page of the document data, 

20 or additional data added to the document data, and dis- 
plays the read data on the display 44 in a manner in 
which the user can select desired document data. The 
selected document data is read out from the mass stor- 
age device 13 and may be transferred to the copying 

25 machine 11 so as to be printed and outputted as a hard 
copy of the document data. Thus, the server machine 
1 2 functions as a file unit so that the data management 
system 10 serves also as a file system. When reference 
to document data stored in the mass storage device 16 

30 is required, the server mach ine 1 2 accesses to the serv- 
ice provider on the internet by using the address of the 
service provider, the registered ID, or the password in 
the memory section 42 to process the data in the mass 
storage device 16 in the same manner. In other words, 

55 the server machine 12 functions as a storing manage- 
ment device and an output management device. 
[0043] Specifically when a copying operation is se- 
lected by an operation of the operating section 23 of the 
copying machine 11, as illustrated in a flowchart in Fig. 

40 7, by a depression of the start key 23d directly (Steps 
P1 and P2), for example, the copying machine 11 reads 
and copies document data from a document which has 
been set to the reader 26 (Step P3), and in concurrence 
with this operation, if the ID card is set to the operating 

45 section 23, the copying machine 11 authenticates an op- 
erator (a user of the copying machine 11) based on the 
user ID read out from the ID card (Steps P4 and P5). If 
the user ID cannot be obtained, a shared ID read from 
the nonvolatile RAM of the control section 21 is as- 

50 sumed to be a user ID and the authentication of the op- 
erator is set to "No setting" (Steps P4, P5, and P7). 
[0044] Additionally, concurrently with processing with 
the selected function, the copying machine 11 encodes 
and compresses the same document data by the image 

55 processing section 29, adds processing date and time 
information, timed with a timer facility (not shown), and 
processing conditions (a reduction ratio, etc.), together 
with the user ID, to the document data as additional data 
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(code data), and sends out the encoded and com- 
pressed document data with the additional data to the 
server machine 1 2 so as to be stored (filed) in the mass 
storage device 13 on the intranet (Step P8). In this 
processing, the document data is temporarily stored in 
the image memory section 28 of the copying machine 
1 1 , and is then sent to the server machine 1 2 with being 
synchronized with an FGATE signal indicating an Image 
area. In addition, the additional data is sent to the server 
machine 12 synchronized with a COMM signal indicat- 
ing an information area. The additional data is sent, as 
illustrated in Fig. 8, In a form in which the additional data 
is added only to the first or last document data even if a 
document P ranges over a plurality of pages. Thus, the 
document data and the additional data are associated 
with each other and are integrated to single processing 
so as to save the storage space in the mass storage 
device 1 3 or 1 6. It is needless to say that when process- 
ing conditions are desired to be grasped in more detail, 
such as for example, when the copy density is adjusted 
per page in copying processing, additional data may be 
added to the document data per page to be sent to the 
server machine 12, as illustrated in Fig. 9. 
[0045] Thus, document data which is processed by 
the copying machine 11 is automatically stored in the 
mass storage device 1 3 or 1 6 without a need for special 
input operations (i.e., regardless of a presence or ab- 
sence of a storing instruction input) except for the oper- 
ations for executing its processing, with the additional 
data for specifying the document data automatically 
added (associated) thereto. Even for document data to 
be processed without a user ID, the copying machine 
11 stores the document data in substantially the same 
manner, without requesting an input of a user ID, using 
a shared ID. 

[0046] Subsequently when the IC card is extracted, 
the copying machine 11 determines that the operator 

terminates the processing (Step P9). Also, when detect- 
ing that a preset time is elapsed based on the time reg- 
istered by the timer facility (not shown) for a time period 
from an end of reading the document which has been 
set in the reader 26, the copying machine 1 1 determines 
that the operator has terminated the processing (Step 
11). If either of the conditions is satisfied, the copying 
machine 1 1 clears the user I D for specifying the operator 
who has performed the document data processing and 
sets (authenticates) a shared ID of a default, which is 
intended for use by a user who cannot obtain a user ID, 
as "No setting" of an operator, in order to prevent a dif- 
ferent user from using an identical user ID (Step P12). 
In these Steps P9 and PIT when a start instruction of 
new processing is issued by a depression of another key 
input, such as, for example, the F key 23 or the start key 
23d, before the preset time is elapsed, with the IC card 
being set (Step PIG), the process returns to Step PI, 
keeping the identical user ID, to repeat the same 
processing. 

[0047] Therefore, when the operator changes, a user 



ID is obtained again and thereby the exchange of oper- 
ator is reliably detected and the user I D is correctly add- 
ed to the document data. 

[0048] Furthermore, if an "Undo" button, which is not 

5 shown (different from the "Job recall button" 23f in Fig. 
3) and which is arranged in the operating section (the 
operator panel) 23 for specifying an input of a storage 
inhibition instruction, is depressed between the Step 2 
and the Step 11 (Step PI 00), the copying machine 11 

10 skips the steps of storing the document data in the mass 
storage device 13 or 16, i.e., the Steps P4 - P11, and 
continues only processing of the provided functions in 
the control program. If the "Undo" button is depressed 
after the document data is started to be stored by the 

^5 execution of the Step P8, the document data having 
been stored or under storing processing is invalidated 
for reading and is deleted by deleting the additional data 
of the document data before an execution of the next 
processing instruction, so that storing the document da- 

20 ta is canceled. When the "Job recall button" 23f is de- 
pressed to cancel the instruction of the copying process- 
ing, the same processing is performed as for the depres- 
sion of the "Undo" button. 

[0049] Accordingly, the document data, which is 
25 stored in the mass storage device 1 3 or 16 as backup 

data without a request for an input operation except the 
operations performed by a user to use functions of the 
copying machine 11, can be deleted only by a depres- 
sion of the "Undo" button of the operating section 23 be- 

30 tween the Step P2 and the Step P11 . Therefore, when 
copying a confidential image, for example, it can be eas- 
ily avoided that the data of the confidential image is filed 
in the mass storage device 13 or 16 for reuses. 
[0050] As the additional data to be sent from the cop- 

55 ying machine 11 to the server machine 12, the copying 
machine 11 obtains transmission processing conditions 
for transmission, such as a telephone number and an 
address of a destination, and processing conditions for 
copying (recording), such as conditions related to doc- 

40 ument sheets or recording sheets and conditions related 
to image processing on image data, and then adds 
these conditions to the document data to be stored. The 
additional data can be any information useful for speci- 
fying processing. For copying processing, for example, 

45 the copying machine 11 allows for a user to select fol- 
lowing functions in order to enhance the utility of copying 
processing; a copy density an image processing mode 
(such as, image quality correcting processing, etc.), a 
magnification ratio (for reduction and enlargement), 

50 post-processing of sheets (such as, sorting and stapling 
sheets), both-side copying, divide copying, collect cop- 
ying, adding information of a date, a stamp or a page, 
which is printed on a sheet, and edited copying. The 
copying machine 11 receives (obtains) these image 

55 processing conditions as processing conditions togeth- 
er with the document and sheet conditions, such as the 
number of copying sheets, the document size and the 
direction of the document, which are automatically rec- 
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ognized in a range from the maximum A3 size to the 
minimum B6 size or selected according to settings by 
the user, and adds the processing conditions and the 
documents and sheets conditions to the document data 
to be stored. 

[0051 ] I n th e copying mach ine 11 , if the copy function 
is selected by a depression of the F key 23c instead of 
copying documents is started by depression of the start 
l^ey 23d under the default copying conditions (automatic 
sheet selection, magnification ratio of 100%, automatic 
density, etc.), the copying machine 11 allows a user to 
set or select numeric values of the document and sheet 
conditions and the image processing conditions and 
various modes (corresponding to commands in Fig. 11 ) 
by operations of a screen in Fig. 10 displayed on the 
display operation LCD 22a and the ten key 23b. For ex- 
ample, the number of copying sheets entered from the 
ten key 23b (the number of copies which can be set also 
in default) can be set as additional data, for example, by 
adding registration data for the entered number of cop- 
ying sheets after the command 26H. When the registra- 
tion data is, for example, 1, a command like "26" "00" 
"01" is put as illustrated in Fig. 12. As additional data of 
the document and sheet conditions or the image 
processing conditions entered from the display opera- 
tion LCD 223; automatic density setting data for an au- 
tomatic copy density setting according to an image or 
density setting data for an arbitrary copy density setting 
in one of seven grades is set in a command 33H. Fur- 
ther, document type data for an image type, such as 
characters image, photographs image, and characters/ 
photographs mixed image, is set in a command 38H, 
feeding sheet data for a sheet size and/or direction by 
designation of one of the feed cassettes for sheets to 
be used in a command 31 H, and automatic sheet selec- 
tion mode setting data for automatically selecting a feed 
cassette (sheets) according to a document size and di- 
rection and a magnification ratio for the document in a 
command 32H. Furthermore, magnification mode set- 
ting data for a magnification ratio, such as, a standard 
magnification determined by a document size and a 
sheet size, zooming in units of a percent made by an 
arbitrary input setting, size magnification made by input 
setting of lengths of a document image and a copied 
image, and independent magnification made by input 
setting of magnification ratios different in vertical and 
horizontal directions, is set in a command 35H. Also, 
both-side copying mode setting data for both-side cop- 
ying in which an image on a both-sided or single-sided 
document or two-page spread document is recorded on 
both sides of a sheet is set in a command 27H, divide 
copying mode setting data for divide copying in which 
each image of a both-sided or two-page spread docu- 
ment is recorded on each single side of sheets in a com- 
mand 28H, collect copying mode setting data for collec- 
tively copying a plurality of images in which a plurality 
of document images are collected to a single side or 
both sides of sheets in a command 29H, printing mode 



data for printing additional information such as a 
processing date, a stamp such as an "Urgent" or user 
mark, and the number of pages automatically added to 
a copied image in a command 2AH, and editing mode 

5 data for editing copying, such as, a double copy in which 
identical images are arranged on a single side, a margin 
creation in which margins are left in a center or edge 
portions of book documents, a binding margin creation 
in which a margin is left along a single edge of a sheet, 

10 erasing processing in which only a specified color is 
erased, in a command 34H. 

[0052] On the other hand, the sen/er machine 1 2 has 
a database in which additional data is stored with being 
sectioned for each user ID in the hard disk unit 43 so 

^5 that document data stored in the mass storage devices 
13 and 16 can be easily retrieved. When additional data 
including appendix information, such as a user ID, 
processing date information, processing conditions, and 
a title added to document data received from the PCI 4, 

20 is received together with document data from the copy- 
ing machine 11 via the intranet by an execution of the 
Step P8 in Fig. 7, as illustrated in Fig. 13, the server 
machine 12 stores the received document data in the 
mass storage device 1 3 as backup data of the received 

25 document data and further registers the additional data 
sent from the copying machine 11, such as processing 
date information, processing conditions, a title, or the 
like for specifying document data, in a field prepared for 
each type of additional data in the database of the hard 

30 disk unit 43, while associating them with a user ID, so 
that they can be readily used for retrieving document 
data (Step P31). 

[0053] The GPU 41 of the server machine 1 2 further 
performs document analysis processing, such as, cor- 

55 reefing or complimenting document data, processing of 
discriminating document regions pi to p4 or image re- 
gions p5 and p6 from each other in the document P il- 
lustrated in Fig. 20, in order to obtain additional data for 
further specifying the document data (Step P32). Fur- 

40 thermore, character data of the document data is en- 
coded by being processed with an optical character rec- 
ognition (OCR), and then keywords frequently used in 
the sentences are obtained (Step P34). The keywords 
are then registered in the database so as to be associ- 

45 ated with the stored document data (Step P35). There- 
fore, document data stored in the mass storage devices 
1 3 and 1 6 can be easily specified also according to the 
above described additional data. 
[0054] Accordingly, in the server machine 1 2, if a user 

50 requests to reference document data by entering a user 
ID, the CPU 41 reads out the document data associated 
with the user ID from the mass storage devices 13 and 
1 6 and additional data from the hard disk unit 43 of the 
sever machine 12 and displays them on the display 44 

55 according to the reference instruction. At this point, as 
illustrated in a flowchart in Fig. 14, the CPU 41 creates 
a display screen in a calendar view format 60 which can 
be scrolled at a high or low speed with scroll buttons 59, 
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as illustrated in Fig. 15, and also creates thumbnail im- 
ages 61 as illustrated in Fig. 1 6 by reducing the first pag- 
es of tlie document data and display the thumbnail im- 
ages 61 according to each processing date (processing 
date Information) (Step P51). If there has been issued 
an instruction for displaying related documents de- 
scribed later with relating them with the document data, 
the CPU 41 executes the corresponding processing 
(Steps P52 and P53), and further, if an operating instruc- 
tion for retrieving document data is entered subsequent- 
ly (Step P54) the CPU 41 executes various types of 
processing (Step P55). For example, if a user wants to 
checkthe contents of the document, selected document 
data can be displayed being expanded on the entire dis- 
play 44 by a selection of the corresponding thumbnail 
image 61 with a click of a mouse 47 or with a depression 
of a displayed location of the thumbnail image 61 on the 
touch panel 45. The displayed image can be scrolled 
with manipulation of the scroll button 59. 
[0055] Furthermore, in the server machine 12, a se- 
ries of lists illustrated in Fig. 17 can be sequentially dis- 
played from the list in the upper left of the drawing by 
selecting the retrieval button 64 in the display screen in 
the calendar view format 60 of the display 44, so that a 
user can check the contents of the document data 
processing. When a user wants to check the contents 
of document data of an image which is copied, an addi- 
tional data list 66 is displayed including a title or keyword 
of the document data by selecting a copy button 65, so 
that he or she can check the contents, and further, by 
selecting processing conditions in the list 66, an addi- 
tional data list 67 is displayed, including the number of 
copies, a document type and so forth so as to be 
checked. In this processing, if a user requests to refer- 
ence document data included in an arbitrary period by 
specifying the period as additional data by an input of 
date information, the sen/er machine 12 executes refer- 
ence processing using a calendar on which the period 
is displayed. 

[0056] Additionally if a user requests to reference 
document data by entering a user ID, the server ma- 
chine 12 displays lists illustrated in Fig. 18 in the display 
44 sequentially from the list in the upper left of the draw- 
ing by selecting a narrow-down button 63 illustrated in 
Fig. 15, and displays thumbnail images 61 on the cal- 
endar view 60 so as to be selected, with unnecessary 
document data omitted by selecting a type of the addi- 
tional data. When the narrow-down processing is per- 
formed based on the additional data related to the doc- 
ument, by selecting a document button 68, a mode-set- 
table conditions list 69 is displayed and a condition can 
be specified by clicking a "V" mark in the right column 
of the document size, or the like. In a both-side copying 
mode, for example, the server machine displays Single- 
sided Both-sided, Both -sided -> Both-sided, Both- 
sided for left and right pages, and Both-sided for both- 
side pages so as to be selected, and after selection, in- 
verts the thumbnail images 61 in the calendar view 60 



of the document data associated with the corresponding 
additional data by selection of an execution button 70 
displayed on the same screen. If there are a plurality of 
corresponding document data, the additional data list 66 

5 can also be displayed including a title or a keyword of 
the document data by selecting the retrieval button 64 
and the copy button 65 illustrated in Figs. 15 and 17 in 
the manner as described above. The user can then se- 
lect a desired thumbnail image 61 , and display the de- 

10 sired document data on the entire display 44 by select- 
ing a call button 62. 

[0057] Therefore, when a user desires to retrieve doc- 
ument data processed by the copying machine 11 for 
reusing the document data by selecting a menu for re- 

^5 questing reference, the server machine 12 can display 
thumbnail images 61 of the document data having an 
identical user ID, for example, from the latest one or from 
one at an arbitrary time, in a calendar format. In addition, 
by selecting the thumbnail image 61 of desired docu- 

20 ment data using a mouse, the desired document data 
can be properly read out (the entire document data 
which has already been processed is re-obtained) from 
the mass storage device 1 3. The document data is then 
sent to the copying machine 11 together with the addi- 

25 tional data, and the copying machine 1 1 can restore the 
document data by decoding it using the image process- 
ing section 29 and can record it based on the additional 
data used for image processing. Thus, document data, 
which is stored as backup data when the document data 

30 is processed with a certain processing conditions, can 
be reproduced so as to be available without input oper- 
ations of the processing conditions. 
[0058] Returning to Fig. 14, when desired document 
data is found out, by selecting the corresponding thumb- 

55 nail image 61 with a mouse or on a touch panel and by 
selecting the call button 62 to specify an output destina- 
tion (Step P56), the document data can be properly read 
out (the entire document data which has already been 
processed is re-obtained) from the mass storage device 

40 1 3 SO as to be displayed on the entire display 44 or the 
document data can be restored by decoding it using the 
image processing section 29 and be recorded based on 
the additional data used for image processing by send- 
ing the document data together with the additional data 

45 to the copying machine 11 (Step P57), so that the doc- 
ument data which is stored as backup data when the 
document data is processed can be reproduced to be 
available without any input operations of processing 
conditions. Until a quit button (not shown) is selected, 

50 the process returns to the Step P51 to repeat substan- 
tially the same processing. This processing terminates 
when the quit button is selected (Step P58). The user 
referencing the document data may enter the additional 
data for the image processing from the operating section 

55 23 of the copying machine 1 1 . Further, when a reference 
request is made for document data older than data 
stored in the mass storage device 13, the server ma- 
chine 12 reads out an address of a service provider a 
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registered ID for each user ID, and a password from the 
built-in nonvolatile RAM, accesses to the service pro- 
vider on the Internet to send out a reference instruction 
of the document data, by which the mass storage device 
16 on the Internet can be treated in substantially the 
same manner as the mass storage device 13 for refer- 
encing the stored document data. The server machine 
12 can also reference shared document data which is 
processed without an input of a user ID and processed 
with being associated with a shared ID, according to a 
reference instruction of the shared document data, un- 
der the condition that a user ID used for the copying ma- 
chine 11 is entered, by considering the shared ID for the 
copying machine 11 as a user ID used for the reference. 
[0059] In this processing, the CPU 41 of the server 
machine 1 2 includes an identity determination device 53 
illustrated in Fig. 19, which checks whether or not the 
document data sent from the copying machine 11 has 
some relation to the document data already stored in 
the mass storage device 1 3 or 1 6 being associated with 
the identical user ID in the database. When it is deter- 
mined that the document data has some relation, link 
information is added to both of the document data (in 
other words, extending a relation) before the document 
data is stored. While the CPU 41 functions as the identity 
determination device 53 immediately after the docu- 
ment data is received in this embodiment, the CPU 41 
may be configured to function as the identity determina- 
tion device 53 in a time period such as the night time in 
which the copying machine 11 will not be used. 
[0060] Specifically, returning to Fig. 13, after key- 
words are obtained by applying the OCR processing to 
the document data received from the copying machine 
11 and the keywords are registered to the database 
(Steps P34 and P35), the identity determination device 
53 determines whether or not new document data (new 
document data) has some relation to the already stored 
document data (old document data), for example, 
whether the new document data is a document com- 
pletely identical to the old document data, whether the 
new document data is an updated document which has 
been partially changed from the old document data, or 
whether the new document data is a related document 
having some relation to the contents of the old document 
data (Step P36). 

[0061] If the new document data is determined to be 
completely identical to the old document data as a result 
of the determination (Step P37), the same link informa- 
tion is associated with both of the additional data so as 
to register the additional data to the database and delete 
(cancel) the new document data from the mass storage 
device 13 (Step P38). When the Step P53 in Fig. 14 is 
executed, the thumbnail image 61 is created and dis- 
played in the calendar view 60 for each date using the 
old document data in common and the thumbnail image 
61 for each date blinks (inverted) being synchronized 
with each other to highlight that the document data rep- 
resented by each thumbnail image 61 is an identical 



document. An existence of the identical document is in- 
dicated in the list of the additional data, so that the ad- 
ditional data can be displayed for a check. Therefore, 
the user can reuse the document data in substantially 
5 the same manner by selecting either of the thumbnail 
images and checking the additional data. The above op- 
eration can be also applicable to a case for document 
data not completely identical to the old document data, 
which will be described later. 
10 [0062] If the new document data is determined to be 
updated document data which has been partially 
changed from the old document data (Step P39), docu- 
ment data other than document data in the updated re- 
gion (updated data) is deleted from the mass storage 
^5 device 13 or 16 and only the updated data is stored in 
the mass storage device 1 3 or 1 6 so as to be associated 
with the additional data. Further, updated link informa- 
tion, such as for example, version information, is asso- 
ciated with the additional data of both document data 
20 (Step P40), so that the thumbnail image 61 of the up- 
dated document data is created and displayed by re- 
placing the corresponding region of the old document 
data with updated data. At an execution of Step P53 in 
Fig. 14, the corresponding thumbnail image 61 is invert- 
25 ed in blinking at relatively longer intervals than for the 
identical document data, as the updated region is small- 
er, to highlight a degree of the identity in the calendar 
view 60. 

[0063] If a match is found in a preset or greater 
30 number of keywords between the new document data 
and the old document data and as the result the new 
document data is determined to be related document 

data which has some relation to the old document data 
(Step P41), the related link information is associated 
35 with the additional data of both document data in the 
same manner (Step P42). At an execution of the Step 
P53 in Fig. 14, the corresponding thumbnail image 61 
is inverted in blinking at relatively longer inten/als than 
for the identical document data, because there is less 
40 relation between the new document data and the old 
document data, such as a smaller number of matched 
keywords, to highlight a degree of the identity in the cal- 
endar view 60. 

[0064] Accordingly, in filing document data in the 
45 mass storage device 13 or 16 as a backup file, by elim- 
inating the identical or updated document, the storage 
capacities of the mass storage devices 13 and 16 can 
be prevented from being used wastefully, by which the 
number of document data which can be stored in the 
50 mass storage devices 13 and 16 is increased. In addi- 
tion, document data having an identity can be easily dis- 
criminated from other document data so as to be select- 
ed. 

[0065] Whether the new document data is identical to 
55 the old document data or is updated document data is 
checked by a comparison between the new document 

data and the old document data in units of a page or for 
each block in a page by keeping image data of image 
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regions together with character code data, to which the 
document data has been converted through the OCR 
processing by the CPU 41 , for a fixed period (for exam- 
ple, one month) in the hard 6\sW unit 43. The image data 
is compared after correction or modification, such as 
edge matching. In the comparison for each blocl^, as il- 
lustrated in Fig. 20, for example, if document blocks 
such as p1 to p4 and image blocks such as p5 and p6 
are included in the document, the data is compared for 
respective blocks. If a part of the blocks, for example, 
the block p4, does not match, the block p4 is stored for 
filing as updated data. Furthermore, because the OCR 
processing of the document data may not be perfectly 
performed, when the document data includes only char- 
acters, the document data may be determined to be 
completely identical to the old document data if they 
completely match in the number of characters, positions 
of punctuation marks, and the number of words (includ- 
ing a space between characters in English sentences), 
and be determined to be updated document data if a 
font size or a color specification for characters is differ- 
ent between them or the sentences contain revision 
symbols (specific symbols indicating modifications in 
the sentences). 

[0066] Also, even when new document data is not de- 
termined to be identical or updated document data, the 
new document data is stored for filing as related docu- 
ment data if a match occurs in a preset or greater 
number of keywords between the new document data 
and old document data or their titles are identical. In ad- 
dition, when important sentences In the new document 
data, which may be extracted from the document data, 
for example, in a method disclosed in Japanese Laid- 
open Patent Publication No. 9-34905, are identical to 
those in other old document data, the new document 
data may be stored also as related document data for 
the other old document data containing the identical 
sentences. In this processing, the keywords not includ- 
ed in common in the other old document data may be 
additionally registered (in other words, merged) also to 
the other old document data, so that the other old doc- 
ument data can be retrieved according to the keywords 
not included in the document data. Thus, related docu- 
ment data, which cannot be retrieved based upon the 
keywords included in the document data, can be extract- 
ed according to the added keywords, by which a retriev- 
al efficiency is improved. 

[0067] In addition, link information may be registered 
to the database by moving the thumbnail image 61 in 

the calendar view 60 on top of another with an operation 
of the mouse 47 for the sen/er machine 12 (what is 
called, a drag and drop operation) and inputting the link 
information to the database. When the relation becomes 
unclear after a long elapse of time, link information may 
be registered with the drag and drop operation after 
checking the document data by specifying, displaying 
or recording on a sheet additional data, such as key- 
words or titles of respective document data in a row so 



as to be compared, or by processing the document data 
with the OCR processing and inverting only different 
portions in the OCR processing. 
[0068] As described above, in this embodiment, new 

5 document data processed by the copying machine 11 is 
compared with the old document data, and if an identity 
is found between the new and old document data, they 
are related with each other according to the link infor- 
mation. If the new and old document are identical, stor- 

10 ing the new document data is avoided. When the new 
document data is determined to be updated document, 
only the updated portions are stored. Thus, the storage 
capacities of the mass storage devices 13 and 16 can 
be efficiently used. In addition, for the document data 

^5 having an identity with the old document data, each 
thumbnail image 61 in the calendar view 60 is inverted 
in blinking so as to highlight a degree of the identity by 
which a presence or absence of similar document data 
or duplicated document data or relations in document 

20 data can be easily recognized by dates in the calendar 
view 60 and display formats of the thumbnail images 61 . 
Therefore, the document data having an identity can be 
easily discriminated from other document data so as to 
be selected. 

25 [0069] Accordingly a user can store document data 

to be processed with the copying machinell in a mass 
storage device as a backup file without a need for any 
filing works, and further, the user can easily select and 
usefully reuse the document data without a need for 

30 keeping documents, such as copied materials, in a file. 
[0070] Furthermore, new document data processed 
by the copying machine 11 is stored in the mass storage 
device 13 or 16, or the storage thereof is canceled on 
the condition that the same user ID is used. Therefore, 

35 even when a same document is processed with the cop- 
ying machine 11 or another copying machine on an in- 
tranet by a plurality of users, such as, when materials 
for a meeting are created for distribution by the copying 
machine 11 or by another copying machine on the in- 

40 tranet by a user and the distributed materials are copied 
again by the copying machine 11 by another user, the 
same document data is stored in the mass storage de- 
vice 13 or 16 for respective users and it is prevented 
that storing of the same document data by the another 

45 user is canceled for the reason that they are identical 
document data. In addition, a security of the document 
data can be ensured, because document data stored in 
the mass storage device 1 3 or 16 with a certain user ID 
can be referenced only when the same user ID is used. 

50 [0071] As another aspect of the above embodiment, 
though not illustrated in the drawings, athumbnail image 
61 of document data having an identity may be dis- 
played, for example, in red when the document data is 
identical, and the display color may be made thinner as 

55 a degree of the identity becomes lower. Alternatively 
similar colors may be used to highlight the degree of the 
identity. For example, identical document data may be 
displayed in red, while document data having an identity 
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be displayed in rose or orange. 
[0072] As anotlier aspect of the above embodiment, 
the file system may be configured so that a message is 
displayed in the display 44 asking a user who uses the 
copying machine 11 whether or not document data is 
stored in the mass storage device 13 or 16, or whether 
new document data is determined to be updated docu- 
ment data. This constitution makes it possible for a user 
to cancel a storage of the processed data in the mass 
storage device 13 or 16, or to store document data as 
updated document data only when he or she confirms 
and Instructs that new document data processed by the 
copying machine 11 Is Identical with old document data 
which has been stored or that the new document data 
is updated document data, by which the user can avoid 
an execution of processing against his or her will. 
[0073] Specifically, it is hard to determine the identity 
between new document data and old document data 
without error due to, for example, revision made In a 
document R or dust or the like on the document P. If the 
determination standard is lowered, a lot of document da- 
ta are extracted because of having an identity while if 
the determination standard is raised, not only the deter- 
mination takes a long time, but document data may be 
incorrectly determined to have an identity In spite of be- 
ing identical document data or determined to have no 
identity in spite of being document data having an iden- 
tity 

[0074] Accordingly, when an existence of old docu- 
ment data is Identified as a result of being determined 
to have an identity to new document data at a threshold 
level where there is no possibility of reading and pro- 
ducing dust on the document P in processing with the 
copying machine 1 1 , the sen/er machine 1 2 first creates 
a thumbnail image 61 of new document data and dis- 
plays the thumbnail image 61 so as to be inverted in 
blinking in the calendar view 60 together with the corre- 
sponding old document data in the mass storage device 
1 3 or 1 6. A user may determine whether the new docu- 
ment data is identical to the old document data based 
upon a date in the calendar view 60 and a display format 
of the thumbnail image 61 , or for indistinguishable doc- 
ument data, the user may determine its Identity or ne- 
cessity by displaying the old document data on the dis- 
play 44 by double-clicking the thumbnail image 61 of the 
corresponding old document data. If a delete button (not 
shown) is clicked after clicking the thumbnail image 61 
of the new document data inverted in blinking as a result 
of this determination, the new document data is can- 
celed to be stored in the mass storage device 13 or 16, 
while if a register button (not shown) for independent 
registration, update registration with region specifica- 
tion, or related registration, is clicked after clicking the 
thumbnail image 61, the new document data is stored 
in the mass storage device 1 3 or 1 6 in the same manner 
as for the above embodiment. 

[0075] Accordingly, document data of an original doc- 
ument processed with the copying machine 11 can be 



prevented from being deleted by mistake or from being 
stored as updated document data as the result of being 
incorrectly determined due to dust on the original docu- 
ment, by which the new document data can be appro- 

5 priately stored in the mass storage device 1 3 or 1 6. It is 
needless to say that new document data may be tem- 
porarily stored in the hard disk unit 43 of the sen/er ma- 
chine 1 2 to perform the above identity determination be- 
fore storing the data in the mass storage device 13 or 

10 1 6 and that a thumbnail image 61 of deleted document 
data may be deleted from the calendar view 60. 
[0076] Further, the degree of an Identity of processing 
data processed with the copying machine 11 can be de- 
termined based upon a degree of an identity of an image 

^5 of an original document and the original document itself, 
from which the processing data has been obtained with 
the copying machine 1 1 . 

[0077] More specifically, as illustrated in Fig. 21 (a), 
the identity of the Image of the original document may 
20 be determined by evaluating the identity of such aspects 

of the image as, for example, the arrangement of image 
portions and character portions, the ratio between the 
image portions and the character portions, and respec- 
tive contents of the image and character portions. The 

25 Image portions can be evaluated by evaluating such as- 
pects as, for example, the arrangement of images, the 
colors of the images, and the character portions can be 
evaluated by evaluating aspects, such as for example, 
the arrangement of characters, colors of the characters, 

30 the number of the characters, the number of punctua- 
tion, the kind of fonts, each aspect weighted as illustrat- 
ed in the drawing. The words which are extracted by the 
OCR processing are not used in determining the degree 
of the identity and used only for determining if the doc- 

55 ument is identical. 

[0078] Further, the identity of the original document 
itself may be determined by evaluating the identity of 
such aspects of the original document itself as, for ex- 
ample, the size, the direction, whether one-sided or 

40 both-sided, and whether sheet or book, as illustrated in 
Fig. 21(b). 

[0079] Each aspect is given an evaluation value and 
the aggregate amount is given as the identity evaluation 
value for determining a degree of the identity. For ex- 

45 ample, assuming that the maximum aggregate value of 
100 represents the complete identity, when the aggre- 
gate amount is between 0 - 40, it is determined that the 
data has no identity, when the amount is between 40 - 
95, the data has an identity, and when the amount ex- 

50 ceeds 95, the data is identical. 

[0080] The identity of data may be determined based 
upon a result of evaluating all of the aspects of both an 
image of an original document and the original docu- 
ment itself as above, or, for making the determination in 

55 a simple manner or fast, based upon a result of evalu- 
ating only either the aspects of an image of an original 
document or those of the original document itself, or 
based upon a result of evaluating selected aspects of 
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an image of an original documents and/or the original 

document itself. 

[0081] The result of the above determination can be 
informed to the operator, for example, by displaying the 
above aggregate number or a graph representing the 
number in the display 44, or by changing the display of 
related thumbnail images 61 according to the degree of 
the identity in substantially the same manner as de- 
scribed above, such that the operator can determine 
whether to store the data or delete the data with adding io 
the link information. 

[0082] When the data is determined as identical, the 
data is not stored with the link information added thereto, 
when the data is determined not to have an identity, the 
data is stored, and when the data is determined to have 15 
an identity, the image data may be either stored or de- 
leted with the link information added thereto. 
[0083] While the above embodiment has been de- 
scribed for processing with the use of the same user ID, 
it is needless to say that a user I D may be used only for 20 
using the copying machine 11 or for permitting docu- 
ment data processing with the copying machine 1 1 and 
all document data can be reused without using a user I D. 
[0084] In addition, it is needless to say that document 
data may be displayed either as a thumbnail images 61 2S 
in the calendar view 60, a list in order of an identity de- 
gree beginning with the highest one with highlighting, or 
the like, or a combination of them. 
[0085] According to the present invention, processing 
data processed by a processing unit is prevented from 30 
being stored in a memory device if it is determined to be 
identical with stored data based on their identity, by 
which a storage capacity of the memory device can be 
used efficiently. If the processing data is determined to 
have an identity, it can be stored with link information 35 
added so as to be related with stored data, by which, for 
the processing data to which the link information is add- 
ed, specific information for specifying the data, for ex- 
ample, reduced images (thumbnail images) to be dis- 
played in a calendar format having an identity are high- 40 
lighted so as to indicate their identity degree, and there- 
fore the processing data having an identity can be easily 
discriminated from other processing data so as to be se- 
lected. 

[0086] Further, a user can store processing data to be 45 
processed as a backup file without a need for any filing 
works so that the data can be usefully reused. There- 
fore, for example, when copied materials have been 
lost, desired processing data can be easily selected for 
reuse. SO 
[0087] Furthermore, by storing or canceling process- 
ing data or by relating it with other processing data for 
each user ID, the processing data can be stored for each 
user and a security of the processing data is ensured. 
Additionally, by a user's confirmation and specification 55 
of storing processing data, incorrect deletion of docu- 
ment data or incorrect relating of document data with 
each other can be avoided. As a result, a useful file sys- 



tem is provided. 

[0088] Numerous additional modifications and varia- 
tions of the present invention are possible in light of the 
above teachings. It is therefor to be understood that 
5 within the scope of the appended claims, the present 
invention may be practiced othenwise than specifically 
described herein. 



Claims 

1. A file system, comprising; 

processing means for processing data for 
processing with at least one of a copying func- 
tion to read image data of an original document 
and record the read image data on a sheet, a 
transmitting function to send and receive image 
data and/or character data via a communica- 
tion line, and a recording function to record re- 
ceived image data and/or character data on a 
sheet; 

memory means for storing the processing data 
processed by the processing means; 
identity determination means for determining 
whether the processing data is identical to data 
stored in the memory means; and 
storing management means for storing the 
processing data into the memory means on the 
basis of a result of a determination made by the 
identity determination means, 
wherein the storing management means does 
not store the processing data into the memory 
means when the identity determination means 
determines that the processing data is identical 
to data stored in the memory means. 

2. A file system according to claim 1 , wherein the stor- 
ing management means adds link information for 
relating processing data, that has been determined 
to be identical to data in the memory means by the 
identity determination means, with the identical da- 
ta in the memory means. 

3. A file system according to claim 1 or 2, wherein the 
identity determination means determines the iden- 
tity between the processing data and the data 
stored in the memory means based upon informa- 
tion of processes with which the processing data 
has been processed with the processing means. 

4. A file system according to claim 3, wherein the in- 
formation of processes includes information of an 
original document associated with the processing 
data. 

5. A file system according to claim 4, wherein the in- 
formation of an original document includes informa- 



13 



25 



EP 0 980 178 A2 



26 



tion of the size and orientation of the original docu- 
ment. 

6. A file system according to claim 4 or 5, wherein the 
information of an original document includes infor- 
mation as to whether the original document has an 
image on one side or both sides of the original doc- 
ument. 

7. A file system according to claim 4, 5 or 6, wherein 
the information of an original document includes in- 
formation as to whether the original document Is a 
sheet or book. 

8. A file system according to any one of the preceding 
claims, wherein the identity determination means 
determines a degree of the identity between the 
processing data and the data stored in the memory 
means. 

9. A file system according to claim 8, wherein the iden- 
tity determination means determines the degree of 
the identity between the processing data and the 
data stored in the memory means based upon a de- 
gree of an identity of an Image of an original docu- 
ment associated with the processing data and/or a 
degree of an identity of the original document. 

1 0. A file system according to claim 9, wherein the stor- 
ing management means adds link Information for 
relating the processing data with the data stored in 
the memory means based upon the degree of the 
identity determined by the Identity determination 
means. 

11. A file system according to claim 10, further compris- 
ing; 

display means for displaying information; 
operation means for inputting instructions; and 
output management means for creating specif- 
ic information for specifying data In the memory 
means, for displaying the specific information 
on the display means so as to be selectable by 
the operation means and for reading out data 
which has been specified via the selection of 
the specific information specifying the data 
from the memory means to output the specified 
data to the processing means, 
wherein the output management means dis- 
plays the specific information of the processing 
data to which the link information is added on 
the display means with the degree of the iden- 
tity being highlighted. 

12. A file system according to claim 11, wherein the 

storing management means displays in the opera- 
tion means a message for asking a person who 



processes the processing data about storing of the 
processing data to the memory means. 

13. A file system according to any of the preceding 
5 claims, wherein the storing management means in- 
cludes ID obtaining means for obtaining a user ID 
of a user who processes the processing data with 
the processing means and adds the user ID ob- 
tained by the ID obtaining means to the processing 

10 data to be stored in the memory means, and the 
identity is determined by the identity determination 
means between the processing data and the data 
stored In the memory means, having the same user 
ID. 

15 

1 4. A file system according to any one of the preceding 
claims, wherein the processing means and the 
memory means are connected to substantially the 
same Intranet. 

20 

15. A file system, according to any of the preceding 

claims, wherein: 

said memory means comprises first and sec- 
25 ond memory means for storing processing data 

processed by the processing means; 
said identity determination means is for deter- 
mining an identity between the processing data 
and data stored in the first or second memory 
30 means; 

said storing management means is for storing 
the processing data Into the first memory 
means on the basis of a result of a determina- 
tion made by the identity determination means 
35 and the storing management means reads out 

a given amount of document data from the first 
memory means and transfers the given amount 
of document data to the second memory means 
when a preset capacity of the first memory 
40 means is exceeded. 

16. A filing system according to claim 15, wherein the 
second memory means is on an internet. 

45 17. A file system according to claim 15 or 16 when de- 
pendent on claim 11 , wherein the storing manage- 
ment means displays In the operation means a mes- 
sage for asking a person who processes the 
processing data about storing of the processing da- 

50 ta to the first memory means. 

18. A method of filing data, comprising steps of: 

processing data for processing with at least one 
55 of a copying function to read image data of an 

original document and record the read Image 

data on a sheet, a transmitting function to send 
and receive image data and/or character data 
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via a communication line, and a recording func- 
tion to record received image data and/or cliar- 
acter data on a sheet; 

storing the processing data into a memory de- 
vice; 

determining whether the processing data is 
identical to data stored in a memory device; and 
canceling the storing of the processing data in- 
to the memory device when the processing da- 
ta is determined to be identical to data stored 
in the memory device at the identity determina- 
tion step. 

19. A method according to claim 18, wherein the iden- 
tity between the processing data and the data 
stored in the memory device is determined based 
upon information of processes with which the 
processing data has been processed at the 
processing step. 

20. A method according to claim 1 8 or 1 9, further com- 
prising a step of: 

adding link information for relating the 
processing data, determined to be identical to data 
in the memory device at the determination step, with 
the data in the memory device. 

21. A method according to claim 18, 19 or 20, wherein 
a degree of the identity between the processing da- 
ta and the data stored in the memory device is de- 
termined at the identity determination step. 

22. A method according to claim 21 , wherein the degree 
of the identity between the processing data and the 
data stored in the memory device is determined 
based upon a degree of an identity of an image of 
an original document associated with the process- 
ing data and/or a degree of an identity of the original 
document. 

23. A method according to claim 22, further comprising 
the step of adding link information for relating the 
processing data with data stored in the memory de- 
vice based upon the degree of the identity deter- 
mined at the identity determination step. 

24. A method according to claim 23, further comprising 

steps of; 

creating specific information for specifying data 
in the memory device and displaying the spe- 
cific information on a display device so as to be 
selected by an operation device; 
displaying the specific information of the 
processing data to which the link information is 
added on the display device with the degree of 
the identity being highlighted; and 
reading out data which has been specified via 



the selection of the specific information speci- 
fying the data from the memory device to output 
the specified data. 

5 25. A method according to any one of claims 1 8 to 24, 
further comprising a step of: 

displaying a message for asking a person who 
processes the processing data about storing of the 
priocessing data to the memory device. 

10 

26. A method according to any one of claims 1 8 to 25 , 
further comprising a step of: 

obtaining a user ID of a user who processes the 

processing data: and 

adding the user ID obtained at the ID obtaining 
step to the processing data to be stored in the 
memory device, 

wherein, the identity is determined at the iden- 
tity determination step between the processing 

data and the data stored in the memory device, 
having the same user ID. 

27. A method according to any one of claims 18 to 26, 
wherein said memory device comprises first and 
second memory devices; 

in said storing step the processing data is 
stored into the first memory device; and 
in said determining step, an identity between 
the processing data and data stored in the first 
or second memory devices is determined; and 
further comprising the step of reading out a giv- 
en amount of data from the first memory device 
and transferring the given amount of data to a 
second memory device when a present capac- 
ity of the first memory device is exceeded. 
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